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Abstract. We study the minimum connected sensor cover problem (MIN-CSC) 
and the budgeted connected sensor cover (Budgeted-CSC) problem, both moti¬ 
vated by important applications in wireless sensor networks. In both problems, 
we are given a set of sensors and a set of target points in the Euclidean plane. 
In MIN-CSC, our goal is to find a set of sensors of minimum cardinality, such 
that all target points are covered, and all sensors can communicate with each 
other (i.e., the communication graph is connected). We obtain a constant fac¬ 
tor approximation algorithm, assuming that the ratio between the sensor radius 
and communication radius is bounded. In Budgeted-CSC problem, our goal is to 
choose a set of B sensors, such that the number of targets covered by the chosen 
sensors is maximized and the communication graph is connected. We also obtain 
a constant approximation under the same assumption. 


1 Introduction 

In many applications, we would like to monitor a region or a collection of targets of 
interests by deploying a set of wireless sensor nodes. A key challenge in such appli¬ 
cations is the limited energy supply for each sensor node. Hence, designing efficient 
algorithms for minimizing energy consumption and maximizing the lifetime of the net¬ 
work is an important problem in wireless sensor networks and many variations have 
been studied extensively. We refer interested readers to the book by Du and Wan □H 
for many algorithmic problems in this domain. 

In this paper, we consider two important sensor coverage problems. Now, we in¬ 
troduce some notations and formally define our problems. We are given a set S of n 
sensors in W l . All sensors in S have the same communication range R c and the same 
sensing range R s . In other words, two sensors s and s' can communicate with each other 
if dist(s, s') < R c , and a target point p can be covered by sensor s if dist(p, s) < R s . We 
use D(s, R) to denote the disk with radius R centered at point s. Let D c (s) = D(s. R c ) 
and D s (s) = D(s,R 5 ). 
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Assumption 1 In this paper, we assume that R s /R c can be upper bounded by a con¬ 
stant C = 0(1) (i.e., R s /R c < C). Note that this assumption holds for most prac¬ 
tical applications. Without loss of generality, we can assume that R c = 1. Hence, 
Rs = 0( 1 ). 

The first problem we study is the the minimum Connected sensor covering (MIN- 
CSC) problem. This problem considers the problem of selecting the minimum number 
of sensors that form a connected network and detect all the targets. It is somewhat 
similar, but different from, the connected dominating set problem. We will discuss the 
difference shortly. The formal problem definition is as follows: 

Definition 1. MIN-CSC: Given a set S of sensors and a set V of target points, find a 
subset S' C S of minimum cardinality such that all points in V are covered by the 
union of sensor areas in S' and the communication links between sensors in S' form a 
connected graph. 

In some applications, instead of monitoring a set of discrete target points, we would 
like to monitor a continuous range R, such as a rectangular area. Such problems can be 
easily converted into a MIN-CSC with discrete points, by creating a target point (which 
we need to cover) in each cell of the arrangement of the sensing disks {D s (s)} sG g 
restricted in R (see E) for details). 

The second problem studied in this paper is the Budgeted connected sensor cover 
(Budgeted-CSC) problem. The problem setting is the same as MIN-CSC, except that 
we have an upper bound on the number of sensors we can open, and the goal becomes 
to maximize the number of covered targets. 

Definition 2. Budgeted-CSC: Given a set S of sensors , a set V of target points and a 
positive integer B, find a subset S' C S such that iS 7 < B and the number of points in 
V covered by the union of sensor areas in S' is maximum and the communication links 
between sensors in S' form a connected graph. 


1.1 Previous Results and Our Contributions 

MIN-CSC The MIN-CSC problem was first proposed by Gupta et al. H9l . They 
gave an 0(r In n)-approximation (r is an upper bound of the hop-distance between 
any two sensors having nonempty sensing intersections). Wu et al. El give an 0(r)- 
approximation algorithm, which is best approximation ratio known so far (in terms of 
r). If R s < R c / 2, r = 1 and the above result implies a constant approximation. How¬ 
ever, even R s is slightly larger than R c / 2, r may still be arbitrarily large. We also notice 
that if r = 0(1), we must have R s /R c = 0(1). So Assumption's a weaker assump¬ 
tion than the assumption that r = 0(1). 

MIN-CSC is in fact a special case the group Steiner tree problem (as also observed 
in Wu et al (37)). In fact, this can be seen as follows: consider the communication graph 
(the edges are the communication links). For each target, we create a group which con¬ 
sists for all sensor nodes that can cover the target. The goal is to find a minimum cost 
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tree spanning all groups. QGarg et al II 161 . combined with the optimal probabilistic 
tree embedding 03, obtained an 0(log 3 n) factor approximation algorithm the group 
Steiner tree problem via LP rounding. Chekuri et al. © obtained nearly the same ap¬ 
proximation ratio using pure combinatorial method. 

Our first main contribution is a constant factor approximation algorithm for MIN- 
CSC under Assumption^ improving on the aforementioned results. Our improvement 
heavily rely on the geometry of the problem (which the group Steiner tree approach 
ignores). 

Theorem 1. There is a polynomial time approximation algorithm which can achieve 
an approximation factor O ( C' 1 2 ) for MIN-CSC. Under Assumption^ the approximation 
factor is a constant. 


Budgeted-CSC Recall in Budgeted-CSC, we have a budget B, which is the upper 
bound of the number of sensors we can use and our goal is to maximize the number of 
covered target points. Kuo et al. l26l study this problem under the assumption that the 
communication and the sensing radius of sensors are the same (i.e., R s = R c ). They 
obtained an 0( v A B)-approximation by transforming the problem to a more general con¬ 
nected submodular function maximization problem. 

Recently, Khuller et al. |[24l obtained a constant approximation for the budgeted 
generalized connected dominating set problem , defined as follows: Given an undirected 
graph G (V. E) and budget B, and a monotone special submodular function p] / : 2 V —> 
Z + , find a subset S C V such that |5| < B, S induces a connected subgraph and f(S) 
is maximized. If R s < R c /2 in Budgeted-CSC, the coverage function f(S) (the number 
of targets covered by sensor set S ) is a special submodular function. Hence, we have a 
constant approximation for Budgeted-CSC when R s < R c / 2. When R s > Rc/2, f(S) 
may not be special submodular and the algorithm and analysis in ll24) do not provide 
any approximation guarantee for Budgeted-CSC. 

We note that it is also possible to adapt the greed approach developed by group 
Steiner tree © and polymatroid Steiner tree © to get polylogarithmic approximation 
for Budgeted-CSC. However, it is unlike the approach can be made to achieve constant 
approximation factors, and we omit the details. 

In this paper, we improve the above results by presenting the first constant factor 
approximation algorithm under the more general Assumption [I] 

Theorem 2. There is a polynomial time approximation algorithm which can achieve 
approximation factor of for Budgeted-CSC. Under Assumption |7j the approxi¬ 
mation factor is 0(1). 

1 Notice that the group Steiner tree is edge-weighted but MIN-CSC is node-weighted. However, 
since all nodes have the same (unit) weight, the edge-weight and node-weight of a tree differ 
by at most 1. 

2 / is a special submodular function if (1) / is submodular: f(A U {u}) — f(A ) > f(B U 
{«}) - f(B) for anylcBCf; (2) f(A U X) - f(A) = f{A U B U X) - f(A U B) 

if N(X) U N(B) = 0 for any X, A, B C V. Here, N(X) denotes the neighborhood of X 
(including _Y). 
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Our algorithm is inspired by, but completely different from ffl. In particular, we 
make crucial use of the geometry of the problem to get around the issue required by 
l24j (i.e., the coverage function is required to be special submodular in their work). 

MIN-CSC is closely related to the the minimum dominating set (MIN-DS) and the 
minimum connected dominating set (M IN-CDS) problem. In fact, if the communication 
radius R c is the same as the sensing radius R s , MIN-CSC reduces to MIN-CDS. In 
general graphs, MIN-CDS inherits the inapproximability of set cover, so it is NP-hard 
to approximation MIN-CDS within a factor of plnn for any p < 1 II 141101 , Improving 
upon Klein et al. f25l . Guha et al. lll 811 obtained a 1.35 Inn-approximation, which is the 
best result known for general graphs. 

Lichtenstein et al. |28l proved that MIN-CDS in unit disk graphs (UDG) is NP-hard 
(which also implies that MIN-CSC is NP-hard). The first constant approximation algo¬ 
rithm for the unweighted MIN-CDS problem in UDG was obtained by Wan et al. ll34l . 
This was later improved by Cheng et al. 0, who gave the first PTAS. For the weighted 
(connected) dominating set problem , Ambiihl et al. m obtained the first constant ratio 
approximation algorithms for both problems (the constants are 72 and 94 for MIN-DS 
and MIN-CDSrespectively). The constants were improved in a series of subsequent pa¬ 
pers H21I9I39I36I , Very recently, Li and Jin ll27l obtained the first PTAS for weighted 
MIN-DS and an an improved constant approximation for weighted MIN-CDS in UDG. 
Many variants of MIN-DS and MIN-CDS, motivated by various applications in wireless 
sensor network, have been studied extensively. See El for a comprehensive treatment. 

Budgeted-CSC is a special case of the submodular function maximization problem 
subject to a cardinality constraint and a connectivity constraint. Submodular maximiza¬ 
tion under cardinality constraint, which generalizes the maximum coverage problem, is 
a classical combinatorial optimization problem and it is known the optimal approxima¬ 
tion is 1 — 1/e I30ll4l . Submodular maximization under various more general combina¬ 
torial constraints (in particular, downward monotone set systems) is a vibrant research 
area in theoretical computer science and there have been a number of exciting new de¬ 
velopments in the past few years (see e.g., 01331 and the references therein). The con¬ 
nectivity constraint has also been considered in some previous work 1381261241 . some 
of which we mentioned before. 


2 Preliminaries 

We need the following maximum coverage (MaxCov) in our algorithms. 

Definition 3. MaxCov: Given a universe U of elements and a family S of subsets ofU, 
and a positive integer B, find a subset S' C S such that |<S / 1 < B and the number of 
elements covered by U s^S'S maximized. 

We need to following well known result, by O0l20l . 

Lemma 1 (Corollary 1.1 of Hochbaum et al. [203). The greedy algorithm is a (1— -)- 

approximation for M axCov. 

A closely related problem is the hitting set problem. 
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Definition 4. HitSet: Given a universe U of weighted elements (with weight function 
c U —} R + ) and a family S of subsets of U find a subset H C U such that H Cl S ^ 0 
for all S £ S (i.e., H hits every subset in S) and ft c u minimized. 

The HitSet problem is equivalent to the set cover problem (where the elements and 
subsets switch roles). It is well known that a simple greedy algorithm can achieve an 
approximation factor of Inn for HitSet and the factor is essentially optimal 1141101 . In 
this paper, we use a geometric version of H itSet in which the set of given elements are 
points in R 2 and the subsets are induced by given disks (i.e., each S' € S is the subset 
of points that can be covered by a given disk). Geometric hitting set admits constant 
factor approximation algorithms (even PTAS) for many geometric objects (including 
disks) I2l8l29l32l51 . As mentioned in the introduction, MIN-CSC is a special case of 
the following group Steiner tree (GST) problem. 

Definition 5. GST: We are given an undirected graph G = (V, If c, T) where c : E — > 
Z + is the edge cost function, and T is a collection of subsets ofV. Each subset in J- is 
called a group. The goal is to find a subtree T, such that T l~l S 0 for all S £ J- (i.e., 
T spans all groups) and the cost of the tree X] e eT c e minimized. 

Our algorithm for Budgeted-CSC also needs the following quota Steiner tree (QST) 
problem. 

Definition 6. QST: Given an undirected graph G = (V. E,c,p) (c : E -A Z + is the 
edge cost function, p : V —> Z + is the vertex profit function) and an integer q, find a 
subtree T = argmax TcE j-- £T c {e)<q H2 V £ T P( v i) °f the graph G (T tries to collect 
as much profit as possible subject to the quota constraint). 

Johnson et al. m proposed the QST problem and proved that any o-approximation 
for the fc-MST problem yields an a-approximation for the QST problem. Combin¬ 
ing with the 2-approximation for fc-MST developed by Garg [ 1151 , we can get a 2- 
approximation for the QST problem. 

Lemma 2. These is an approximation algorithm with approximation factor 2 for QST. 


3 Minimum Connected Sensor Cover 

We first construct an edge-weighted graph Q c as follows: If dist( SjS ')< R c , we add an 
edge between s and s' (It is easy to see that Q c is in fact a unit disk graph). Q c is called 
the communication graph. Recall that MIN-CSC requires us to find a set of vertices that 
induces a connected subgraph in the communication graph Q c . 

First, we note that Q c may have several connected components. We can see any 
feasible solution must be contained in a single connected component (otherwise, the 
solution can not induce a connected graph). Our algorithm tries to find a solution in 
every connected component. Our final solution will be the one with the minimum cost 
among all connected component. Note that for some connected component, there may 
not be a feasible solution in that component (some target point can not be covered by 
any point in that component), and our algorithm ignores such component. 
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From now on, we fix a connected component C in Q c . Similar with Wu et al. Il37l . 
we formulate the MIN-CSC problem as a group Steiner tree (GST) problem. Each edge 
e £ G[C] is associated with a cost c e = 1. For each target p £ V, we create a group 

gp {p) =C n D(p,R s ) = {s | s <E C, dist(p, s) < R s }. 

The goal is to find a tree T (in G[C]) such that Tn gp(p) 7 ^ 0 for all p £ V and the cost 
is minimized. We can easily see the GST instance constructed above is equivalent to the 
original Ml N-CSC problem (the cost of the tree T is the number of nodes in T minus 
1). The GST problem can be formulated as the following linear integral program: We 
pick a root r £ C for the tree T (we need to enumerate all possible roots). For each 
edge e £ Q[C], we use Boolean variable x e to denote whether we choose edge e. 


minimize E X e (1) 

eesic) 

subject to x e > 1; for all S C C such that r £ S and 3p , S fl G p = 0; 

eGd(S) 

x e £ {0,1}, Ve £ G[C). 

The second constraint says that for any cut d{S) that separates the root r from any 
group, there must be at least one chosen edge. By replacing x e £ {0,1} with x £ [0,1], 
we obtain the linear programming relaxation of 0 (denoted as Lp-GST). By the duality 
between flow and cut, we can see that the second constraint is equivalent to dictating 
that we can send at least 1 unit of flow from the root r to nodes in gp(p), for each p. 
This flow viewpoint (also observed in the original GST paper |fT 6 l ) will be particularly 
useful to us later. So we write down the flow LP explicitly as follows. We first replace 
every undirected edge e = (u, v ) by two directed arcs ( u , v ) and (v,u). For each p £ V 
and each directed arc (m, v ), we have a variable x v uv indicating the flow of commodity p 
on arc ( u , u). We use y% = x v uv — J2 W x vw t0 denote the net flow of commodity p 
into node v. Then Lp-GST can be equivalently rewritten as the following linear program 
(denoted as Lp-flow): 


minimize E %UV ( 2 ) 

(u,v)eG[C] 

subject to y p v =^2 x p uv - ^ x p vw for all v £ C 

U W 

y p = —1 for all p £ V, 

E Vv> 1 for all p£V, 

vGgp{p) 

y p =0 for all u £ gp (p),u ^ r, 
x p v < x uv for all p £V,u,v £C, 
x p v , y p £ [0, 1 ], for all u, v £ G[C]. 


Approximation Algorithms for the Connected Sensor Cover Problem 


7 


Now, we describe our algorithm. Our algorithm mainly consists of two steps. In the 
first step, we extract a geometric hitting set instance from the optimal fractional solution 
of Lp-flow. We can find an integral solution H for the hitting set problem and we can 
show its cost is at most 0(C 2 OPT). Moreover all sensors in H can cover all target 
points p £ V. In the second step, we extract a Steiner tree instance, again from the 
optimal fractional solution of Lp-flow. We show it is possible to round the Steiner tree 
LP to get a constant approximation integral Steiner tree, which can connect all points 
in H. 

Step 1: Constructing the Hitting Set Problem : 

We first solve the linear program Lp-flow and obtain the fractional optimal solution 
( x uv ,y v ). Let Opt(Lp-flow) to denote the optimal value of Lp-flow. We place a grid 
with grid size l = ^ 2 in the plane (i.e., each cell is a ^ x square). For each p £ V, 
consider the set of sensors gp(p), that is the set of sensors which can cover p. Since 
gp(p) is contained in a disk of radius R s < C, there are at most ^ 0(C 2 ) = 0(1) grid 
cells that may contain some points in gp(p). Since S„egp(p) Vv — 1’ there must be a 
cell (say cl (p)) such that 


E Vv> W/C 2 ) = 12 ( 1 ). ( 3 ) 

vGgp(p)nd(p) 

Now, we construct a geometric hitting set (HitSet) instance (U,T) as follows: Let 
the set of points be U = U p6 -p(gp(p) n cl(p)) and the family of subsets be T = 
{gp(p)}pG"P. The goal is to choose a subset H of U such that gp(p) n H ^ 0 for all 
p £ V (i.e., we want to hit every set in T). Write the linear program relaxation for the 
HitSet problem (denoted as Lp-HS): 


minimize E z u (4) 

u£U 

subject to E z u > 1 for all p £ V, 

uGgp(p) 

z u G [ 0 , 1 ], for all u GU. 

Let Opt (Lp-HS) to denote the optimal value of Lp-HS. We need the following simple 
lemma. The proof can be found in the appendix. 

Lemma 3. Opt(Lp-HS) < O((7 2 0pt(Lp-flow)). 

Bronnimann et al. m, combined with the existence of e-net of size 0 (l/e) for disks 
(see e.g., M3 18 ). showed that we can round the above linear program Lp-HS to obtain an 
integral solution (i.e., an actual hitting set) H C U such that \H\ < O(0pt(Lp-HS)) 
(the connection to e-net was made simpler and more explicit in Even et al. 112 ). Hence, 
\H\ < 0(C 2 OPT). 

Step 2: Constructing the Steiner Tree Problem : Recall that for each p £ V, there 
is a cell cl(p) such that X^egp(p)nd(p) Dv ^ I2(l/C 2 ). Consider the collection A = 
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{cl(p) | p G V} of all such cells (if there is a cell which contains the root r, we exclude 
it from A), from each cell cl G A, we pick an arbitrary point, called the representative 
node v(c\) of cl. From Equation [ 3 ] (i.e., X^e gp (p) nd (p) Vv > f?(l/C 2 )), we can see at 
least L?(l/C 2 ) flow of commodity p that enters cl(p). Now, we reroute such flow to the 
representative u(cl(p)), in order to create a Steiner tree LP. Consider the optimal frac¬ 
tional solution (x uv ,y v ) of Lp-flow. We would like to create another feasible fractional 
solution ( x uv ,y v ) for Lp-flow. 

- (Flow Rerouting) For each node u G gp (p) n cl(p), let x p uv{d{p)) <- x p uv{d ^ p)) + y u . 
In other words, we route the flow excess at node u to node v(c\(j))). After such 
updates, for each u G gp(p) I~l cl (p),u ^ v(c\(p)) we can see the flow excess is 
zero, or equivalently yP = 0. The flow excess at node v(c\(p)) is 

^(ci( P) )= E y p v >W/c 2 ). 

w6gp(p)nd( p ) 

We repeat the above process for all cl G A. 

- By uniformly increasing all variables, we obtain another feasible solution (x uv , y v ) : 

x p v = min {C 2 x p uin 1} and = min {C 2 y%, 1}. 

Then, it is easy to see that > 1 for all cl G A. In equivalent words, at least 1 
unit flow (thinking x% lv as flow value on (u, v)) that enters u(cl(p)). 

Now, consider the Steiner tree problem in Q{C) in which the set of terminals is de¬ 
fined to be Ter = {r}U{u(cl) | cl G A}. Let x e = max^g-p x? v +max pe -p -^(Notice 
that Lp-flow is formulated on directed graphs and Steiner tree is formulated on undi¬ 
rected graphs. Here e is the undirected edge corresponding to directed edges uv and 
vu). It is easy to see that x e is a feasible solution for the following linear program 
relaxation for the Steiner tree problem (denoted as Lp-ST): 

minimize E X e (5) 

eeS[C] 

subject to x e > 1, for all S C C such that r G S and 3d G A, u(cl) ^ S 

e£d(S) 

x e G {0,1}, Ve G G[C]. 

Lemma 4. Opt(Lp-ST) < O(C' 2 0pt(Lp-flow)). 

It is well known that the integrality gap of the Steiner tree problem is a constant 
05) . In particular, it is known that using the primal-dual method (based on Lp-ST) 
in flTl (see also |35] Chapter 7.2]), we can obtain an integral solution x e such that 

x e < 20pt(Lp-ST) < O(C ,2 0pt(Lp-flow)) < 0(C 2 OPT). 

eeg[C] 

Let J be the set of vertices spanned by the integral Steiner tree {x e }- The above discus¬ 
sion shows that \ J\ < 0(C 2 OPT). Our final solution (the set of sensors we choose) is 
Sol = H U J. The feasibility of Sol is proved in the following simple lemma. 
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Lemma 5. Sol is a feasible solution. 

Proof. We only need to show that Sol induces a connected graph and covers all the 
target points. Obviously, H covers all target points, so does Sol. Since J is a Steiner 
tree, thus connected. Moreover, J connects all representatives r>(cl) for all cl C A. II 
consists of only sensors in cl € A. So every sensor in v £ H (say v £ cl) is connected 
to the representative rt(cl). So H U J induces a connected subgraph. □ 

Lastly, we need to show the performance guarantee. This is easy since we have 
shown that both \H\ < 0(C 2 OPT) and |J| < 0(C 2 OPT). So |Sol| = O(C 2 0PT) = 
O(OPT) since C is assumed to be a constant. 


4 Budgeted Connected Sensor Cover 

Again we assume that R c = 1 and If = C. Recall that our goal is to find a subset 
S' C S of sensors with cardinality B which induces a connected subgraph and covers as 
many targets as possible. We first construct the communication graph Q c as in Section[3] 
Again, we only need to focus on a connected component of Q c . Then we find a square 
Q in the Euclidean plane large enough such that all of the n sensors are inside Q. We 
partition Q into small square cells of equal size. Let the side length of each cell be 
l = Denote the cell in the ith row and jth column of the partition as cl,.j. Let 
Vi,j = {u £ S | v £ cl ij} be the collection of sensors in cljj. We then partition these 
cells into k 2 different cell groups CG nfll where k = \2C/l + 1]. In particular, we let 

CG a ,b = {cl i,j | i = a(mod£;), j = 6(nrodfc)} for a £ [&], b £ [&]. 

and V a ,b = S n CG a b be the collection of sensors in CG a ,b- 

With the above value k, we make a simple but useful observation as follows. 

Observation 1 There is no target covered by two different sensors contained in two 
different cells of CG a &. 

Denote the optimal solution of Budgeted-CSC problem as OPT. In this section, we 
present an O (<4) factor approximation algorithm for the Budgeted-CSC problem. 


4.1 The Algorithm 

Lor 0 < a,b < k, we repeat the following two steps, and output a tree T with 0(/i) 
vertices (sensors) which covers the maximum number of targets. Then based on T, we 
find a subtree T with exactly B vertices as our final output. 

Step 1: Reassign profit : The profit p(S) of a subset S C S is the number of targets 
covered by S. p(S) is a submodular function. In this step, we design a new profit func¬ 
tion (called modified profit function) p : S -A Z + for the set of sensors. To some extent, 
p is a linearized version of p (module a constant approximation factor). 
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Now, we explain in details how p is defined. Fix a cell group CG,,/,. |^For the 
vertices in CG,, /,, we use the greedy algorithm to reassign profits of the vertices in V’,,/,. 
Among all vertices in V n j,. we pick a vertex v\ which can cover the most number of 
targets, and use this number as its modified profit p(v 1 ). Remove the chosen vertex and 
targets covered by it. We continue to pick the vertex V2 in V,,j, which can cover the most 
number of uncovered targets. Set the modified profit p(v 2 ) to be the number of newly 
covered targets. Repeat the above steps until all the sensors in V,,./, have been picked 
out. For other vertices v which are not in V a ,b, we simply set their modified profit p(v) 
as 0. See also Algorithm |T| in the appendix. 

Let us first make some simple observations about p and p. We use p(S) to denote 
J2vesP( v )- Fu^L it i s n °t difficult to see that p(S) < p(S) for any subset S C S. 
Second, we can see that it is equivalent to run the greedy algorithm for each cell in 
CG a b separately (due to Observation |TJ. Suppose Si C cl Ci d, S 2 C cl c / t d' where cl C) d 
and cl c ' t d' are two different cells in CG a ^, then p(S 1 U S 2 ) = p(S 1 ) + p(S2) due to 
Observation [I] 

Consider a cell cl Ci d € CG a Let D c ^ = {v\,V2, ■■■,v n } C cl e ,d 0 S, where the 
vertices are indexed by the order in which they were selected by the greedy algorithm. 
Let D l cd = {vi,V2,Ik} be the first i vertices in D cd . By the following lemma, we 
can see that the modified profit function p is a constant approximation to true profit 
function p over any vertex subset V C V a ,b- 

Lemma 6. For a set of vertices V in the same cell cl Cj d € CG a b, such that \ V\ < i, we 
have that p(D l c d ) = p{D l c d ) > (1 - 1 /e)p(V). 

Proof. By the greedy rule, we can see p(D l c d ) = p(D' r d ). By Lemma[lJ we know that 

P( D id) > (! - !/e) max|y| < iP (V). □ 


Step 2: Guess the optimal profit and calculate a tree T : Although the actual profit 
of OPT is unknown, we can guess the profit of OPT (by enumerating all possibilities). 
For each 0 < a, b < k, we calculate in this step a tree T of size at most 4 B, using the 
QST algorithm (see Lemma|2]i. We can show that among these trees (for different a, b 
values), there must be one tree of profit no less than (l — OPT. 

After choosing the best tree T with the highest profit, we construct a subtree T of 
size B based on T as our final solution of Budgeted-CSC. 

We first show that there exists 0 < a,b < k, such that based on the modified profit 
p on CG a ,b* there exists a tree with at most 2 B vertices of total modified profit at least 
p- (l — OPT. We use Tqpt to denote the set of vertices of the optimal solution. 


Lemma 7. There exists a tree Tq in Q c , |Tq| < 2B such thatp[Tf) > 


1 



OPT. 


Then, by the Lemma [2] and Lemma [ 7 ] if we run the QST algorithm (with p as the 
profit function), we can obtain the suitable tree T with at most AB vertices of profit 

For each CG a: /,, we define a modified profit function p a ,b- For ease of notation, we omit the 
subscripts. 
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at least A (l — p(OPT). The pseudocode of the algorithm can be found in the full 
version of the paper. 

Lemma 8. Let T be the tree obtained in Algorithm^ then p(T ) > ^ (l - 1] OPT 

Proof. By Lemma [7j we can obtain a tree T with at most 4 B. We also have p(T) > 
p- (l — i) OPT. Since p(S) > p{S) for any S, we have proved the lemma. □ 


Finally, we construct a subtree T of B vertices based on tree T. This can be done 
using a simple dynamic program, in the same way as J24). Denote the subtree with 
highest total profit as T. We can show the following lemma. The proof is similar to that 
in ll24l and can be found in the appendix. 

Lemma 9. p(T) > | p(T ). 


Use the same dynamic programming algorithm in Khuller et al. ]24i |. we can find T 


from tree T. Combining Lemma 8 and Lemma 9 p(T) > 
_i_OPT > ^ 

12.66(8C 2 +4 % /2C+l) 

Theorem |2] 


( 1 - 1 ) 


_I_OPT = 

8 V e) (2V2C+1) 2 

— ID2C - QPT (When C is large). Thus, we have obtained 


5 Conclusion and Future Work 

There are several interesting future directions. The first obvious open question is that 
whether we can get constant approximations for MIN-CSC and Budgeted-CSC with¬ 
out Assumption[l](it would be also interesting to obtain approximation ratios that have 
better dependency on C). Generalizing the problem further, an interesting future direc¬ 
tion is the case where different sensors have different transmission ranges and sensing 
ranges. Whether the problems admit better approximation ratios than the (more general) 
graph theoretic counterparts is still wide open. 
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A Appendix 

A.l Missing Details from Section [3] 

Lemma[3] Opt(Lp-HS) < O(C 2 0pt(Lp-flow)). 

Proof. Suppose ( x uv ,y v ) is the optimal fractional solution for Lp-flow. Now, we want 
to construct a feasible fractional solution {zu] u ^u for Lp-HS such that Y^ u gu Zu — 
0(C 2 J2uv x uv) = 0(C 2 Opt (Lp-flow)). We simply let 

z u = min{l, C 2 maxj/J}. 


From (|3j, we can easily see z u is a feasible solution for the HitSet problem: 

E z u > E min{l, C 2 yP} > 1 for all p £ V 

uegp(p) «egp(p)nci(p) 

It remains to see that 


E - ^ E c2 m ^y v u ^ ° 2 E E 


u£U 


u£U 


pev . 

u£U \tuEC 


r p 

U wu 


- c2 E E = ° 2 E E Xwu 

u£U w€C ^ u£U w£C 


<c 2 E- 


This finishes the proof. 

LemmaJS] Opt(Lp-ST) < O(C 2 0pt(Lp-flow)). 


□ 
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Proof. Recall x e is a feasible solution for Lp-ST and x uv is the optimal solution for 
Lp-flow. We only need to show that 5Z e eS[C] — 0{C 2 3 4 5 6 7 J2( u v)eg[c] Xuv )■ This can 
be seen as follows: 


£e= max (mm{C 2 xl v , 1}) 

e€Q[C] (u,v)€lQ[C] P (il,v)eG[C] P 

< C 2 ^ max x v uv = C 2 ^ x uv . 

(u,v)&g[C] ^ (u,v)€LS[C] 

This finishes the proof of the lemma. □ 

A.2 Missing Details from Section [4] 


Algorithm 1: The Greedy Algorithm for Computing p 

Input: The sensor collection 5, the target collection V, the cell collection CG aj i,. 

Output: Profit function p : V —> Z + U {0} 

1. for all cf ., £ CG aj t do 

2. Pt <r- V UPt is the set of uncovered targets 

3. V s <— Vi j //Vs is the set of available sensors 

4. while 14| ^ 0 do 

(a) v <— arg max„ ePj | Np t (v)| HNp t ( v ) is the set of uncovered targets that can be 
covered by v. 

(b) p(v) «- \N Pt (v)\, P t «- P t \N Pt (v), 14 «- V s \{v} 

5. end for 

6. end for 

7. return p 


'S 


Lemma 7l There exists a tree T 0 in Q c , 17J, < 2B such that p(T 0 ) > 


k 2 


1- - ) OPT 


Proof. We first notice that 


OPT = pl ^ Topt O CG Q ,b J < ^ p (Topt H CG ai b). 

\0<a,b<k J 0 <a,b<k 

Hence, there exists 0 < a' ,b' < k, such that 

p(Topj 0 CG a , M ) > ^ p(Topt O CG Qi b) > ^OPT. 

0 <.a,b<k 

For any cell cl Ci d £ CG a / {,/, suppose n C: d = |Topt (T cl C)C j|. To is obtained from Topt 
by appending all vertices in l) r f * 1 (recall that If. 'f consists of the first n r ,i vertices 
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selected in cl CJ j by the greedy algorithm). Note that we append at most B vertices in 
total, and all vertices are still connected ( since all vertices in the same cell are connected 
). Thus, T 0 is connected and has at most 2 B vertices. 

By Lemma|6j we can see thatp(L)"^ d ) > (l — |) p(Topt Cl cl Cj( /). Thus, we have 

p(T 0 ) = J2 R D Zd) > f 1 - E P ( T opt n cl c ,d) 

cl CjC ^GCG a) 5 

= (i - p(Topt n CG a , b ) > 1 (l - OPT. 

The second equality holds due to Observation [T] □ 

Lemma |9j p(T) > |p(T). 

Proof. We show how to construct a subtree T of B vertices based on tree T. Our tech¬ 
nique is almost the same as Khuller et al. l24l . Their proof used a theorem by Jordan 
l23l : Given any tree on n vertices, we can decompose it into two trees (by replicating a 
single vertex) such that the smaller tree has at most |" j~\ nodes and the larger tree has at 
most nodes. By apply Jordan’s theorem recursively, they obtained the following 
result: Let B be greater than a sufficiently large constant. Given a tree T with 6 k nodes, 
we can decompose it into 13 trees of size at most B nodes each. 

Our construction is almost the same except that the original tree T in our setting has 
at most 4 B vertices. By Jordan’s theorem, we decompose the tree T into two trees T\ 
and T 2 such that |Ti| < 2 B and |X 2 1 <1 B and continue decomposing until the tree 
has at most k vertices (as shown in the figure. Note that each subtree in the white square 
in the figure has at most B vertices). Thus we can decompose a tree of size 4 B to at 
most 8 subtrees of size at most B. See the following figure. 



16 64 

—B+2 777 Br2 


Suppose the subtrees are Ti,T 2 ,...,Tg. Then we have, 

P(T)>lj2p(T t )>lp(T) 

0 2=1 

So there is a subtree of size at most k and profit at least |p(T). □ 
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Algorithm 2: Algorithm for Budgeted-CSC 
Input: The sensor collection <S, the target collection V, budget B. 

Output: a tree T with T < B. 

1 . Construct the communication graph Q c 

2. for a from 0 to k — 1, b from 0 to k — 1 

(a) Reassign every vertex’s profit using Algorithm[T] 

(b) Set every edge’s cost as 1 

(c) ProfitOpt gues3 <r- 1 

(d) Do 

i. T' Run the 2-approximation algorithm of QST on Q c with quota 
ProfitOptg Uess 

ii. if \T'\ < 4BthenT-h- T' 

iii- ProfitOpt guess = ProfitOpt guesa + 1 

(e) While( | T'| < 4 B) 

3. end for 

4. T use the dynamic programming algorithm described in Section 5.2.2 in (24j to find the 
best profit subtree of size B from T. 

5. return T 





